Lower bound on the number of Toffoli gates in a classical reversible circuit through 

quantum information concepts 
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The question of finding a lower bound on the number of Toffoli gates in a classical reversible 
circuit is addressed. A method based on quantum information concepts is proposed. The method 
involves solely concepts from quantum information - there is no need for an actual physical quantum 
computer. The method is illustrated in the example of classical Shannon data compression. 



In the past ten years we have witnessed the birth and 
explosive growth of the field of quantum information and 
computation. The main thrust of this new field is to 
study how quantum systems (such as quantum comput- 
ers and quantum communication devices) can be used 
to solve certain mathematical problems or to improve 
communication capabilities. The crucial feature of this 
approach is that although the quantum systems them- 
selves can be studied with pen and paper, gains are ob- 
tained only when the quantum systems are actually used 
in practice. The gains are due to new physical behavior 
unique to quantum systems and not shared by classical 
ones. In this Letter we take a different direction. We are 
not interested in using quantum systems, rather we want 
to use the concepts and insights gained in the study of 
quantum information for solving mathematical problems. 

The problem we consider here concerns lower bounds 
on reversible classical circuits. Although reversible clas- 
sical computation will probably not be realized in the 
forseeable future (though increasing attention is being 
devoted to this issue), its study has yielded profound 
insights into the theory of complexity and into thermo- 
dynamics; see for a review. 

A reversible classical computation evaluates a function 
/ which takes n-bit input x E {0, 1}™ to n-bit output 
f{x) £ {0, 1}™. Each particular input has its own unique 
output, thus / is invertible. A classical circuit that eval- 
uates / can be reduced to a sequence of elementary re- 
versible logical gates. Examples of reversible one-, two- 
and three-bit gates are NOT, Controlled-NOT (C-NOT) 
and Toffoli (Controlled-Controlled-NOT) gates. C-NOT 
applies NOT on the second bit only if the value of a first 
bit is 1; Toffoli applies NOT on the third bit only if the 
values of both first and second bit are 1. Reversible one- 
and two-bit gates do not constitute a universal set of 
gates. The Toffoli gate however is a universal basic gate 
for reversible classical computation, i.e., any reversible 
classical circuit can be built up from Toffoli gates @. 

Although we can build any reversible circuit out of 
Toffoli gates alone, an interesting conceptual question is 
to find the minimal number of Toffoli gates required if 
one allows for any number of one- and two-bit gates. 



The problem is interesting because Toffoli gates are, in 
a sense, the strongest reversible gates, and the minimal 
number needed tells us about the complexity of the com- 
putation itself. Furthermore, Toffoli gates require physi- 
cal interaction between three bits, and are therefore more 
difficult to implement in practice than one- and two-bit 
gates, and it might be useful to minimize their use. 

We formulate the problem as follows: given a reversible 
function f(x) what is the minimum number of Toffoli 
gates needed to construct a circuit that will evaluate f(x) 
for every x or only for a certain subset of x. 

As far as we know, a systematic approach to this prob- 
lem does not exist. In this paper we use quantum infor- 
mation concepts to address it. In quantum information 
(computation) one classical bit can be encoded in two 
orthogonal states of a quantum system. The main idea 
of our method is to map the bits onto some special quan- 
tum states, and the action of the logic gates onto unitary 
transformations acting on these states. Then, the study 
of the properties of the unitary transformation that is 
associated with the classical reversible computation will 
give information about the classical circuit. The map is 
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The states |0) and |1), the "logical" qubits into which 
the classical bits are mapped, represent entangled states 
of two "constituent" qubits, denoted by the indexes A 
and B. (Throughout this Letter we will use boldfaced 
fonts for the logical qubits and normal fonts for the con- 
stituent qubits.) Here the states |0),|1) are associated 
with orthogonal states of a two-level quantum system. 

A string of n bits is mapped on the associated quantum 
state of n qubit pairs: X1X2 ■ ■ ■ x n — > |xi)|x2) . . . |x n ). 
Any computation x\X2---x n — > fi(x)f2{x)...f n (x) 
is mapped on the same transformation of the 
corresponding quantum states |xi)|x2) . . . |x n ) — > 
|fi(x))|5j(x)) . . . |f n (x)). Since we consider a reversible 
classical computation the corresponding quantum trans- 
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formation is unitary (or part of a unitary if we we 
are interested only in a partial truth table). For ex- 
ample the action of a C-NOT gate a, b — > a, a ® b 
(a, b G {0, 1}) is mapped on the unitary transformation 
Uc-not- |a)|b) -► |a)|a8b). 

The most important property of this mapping is that 
any classical reversible circuit built only from one- and 
two-bit gates is mapped onto a transformation that re- 
quires neither entanglement nor classical communication, 
ic. onto a local unitary transformation U = U A <S> U B 
where U A {U B ) acts only on the A(B) constituent q-bits. 
For example the Uc-not gate can be built from local 
C-NOT gates: 

Uc-not = U A _ NOT ® U B _ NOT (2) 

as can be easily checked explicitly. Here Uc-not 
not) i s a C-NOT gate acting on the A(B) con- 
stituent q-bits in the opposite direction to normal, i.e. 
with the second bit as the control and the first bit as 
the target. (Such bi-lateral transformations were con- 
sidered in 3] for the purpose of density matrix purifica- 
tion) . That the quantum equivalent of any reversible one 
or two-bit gates can be constructed by a similar local bi- 
lateral transformation can be verified explicitly. Hence, 
any circuit built from one or two q-bit gates is local. 

We can use this property of our mapping to analyze 
general circuits: Given a classical reversible computa- 
tion we construct the associated quantum unitary trans- 
formation U ; if U is non-local, the corresponding classi- 
cal transformation cannot be constructed solely by two-bit 
gates. Furthermore, the amount of non-locality in U gives 
a lower bound on the number of Toffoli gates we need. 

We define Ejj, the amount of non- locality in U, as 
the minimum amount of entanglement required to im- 
plement U using only local operations and classical com- 
munication (LOCC). We denote by Et the amount of 
non-locality of the quantum Toffoli gate Ut (we estimate 
Et below). 

One possible implementation of U is to realize the clas- 
sical circuit using the non-local quantum Toffoli gates 
(which cost Et ebits) and the local 2-bit and 1-bit gates. 
Hence U can be implemented using NtEt ebits. This 
yields the lower bound: 

N T > Eu/E T . (3) 

We now arrive at the crucial point of the method. To 
determine Ejj may be a very complicated task - it might 
actually be as complicated as directly determining the 
required number of Toffoli gates. On the other hand, 
it is easier to obtain bounds on Ejj. Upper bounds are 
obtained by explicitly describing an implcmcntion U and 
calculating the amount of entanglement required to carry 
out this implementation using LOCC. Equation (J3J) is 
derived from just such an upper bound. 



To obtain lower bounds on Ejj we apply U on a test 
stated): 

umT) = \<ui), (4) 

where the test state can be any arbitrary superposition 
of basic input states |xi)|x2) . . . |x„). We denote the 
amount of non-locality between A and B possessed by 
\^ln Bt ) and |$*^f/) by E\* st and E^ respectively, where 
E = S(Tr A \V}(y\) = S(Tr B \V)(y\) is the von Neumann 
entropy of the reduced density matrix. (Applying U to 
the test state and computing E^ st and E 1 ^ is straight- 
forward). The amount of non-locality in U is not less 
than the entanglement difference between the two states: 

E v >\EZ st -E^\. (5) 

How good are these bounds on A^? First of all note 
that any test state leads to a lower bound. However, 
different test states may lead to different lower bounds 
because the non-local content of U may not be realized 
in full when U acts on a particular state. (For exam- 
ple, a test state of the form l*-^*) = |xi)|x 2 ) . . . |x n ) is 
transformed into |^^*) = | f x ) 1 f 2 ) • • ■ |fn) and leads to no 
increase in entanglement). Good test states can be found 
either by trial and error, or by systematic optimization, 
although it is unclear whether the method of test states 
can provide tight bounds on Ejj. 

A more important restriction is due to the fact that Eq. 
© can be far from tight. This is because when imple- 
menting the classical circuit some of the Toffoli gates may 
increase the entanglement whereas others may decrease 
it. Thus there may be more efficient ways of implement- 
ing U than realizing the classical circuit. For instance 
if U acts on states composed of n logical qubits, then 
Ejj < In, because one can always implement U by tele- 
porting Alice's qubits to Bob, letting Bob implement U 
locally, and teleporting Alice's qubits back to her. This 
shows that our method can only provide bounds that 
grow linearly in n. On the other hand it is known that 
for some problems of classical reversible computation the 
number of Toffoli gates grows exponentially |Q and for 
these problems our method is very inefficient. Neverthe- 
less we expect that in many cases the number of Toffoli 
gates will grow linearly with, or as a fractional power of, 
n. Bounding the actual power may give an interesting - 
indeed, sometimes fundamental - insight. 

Let us now consider the case of the Toffoli gate itself 
and prove the basic fact that classical Toffoli gates cannot 
be built from reversible two-bit gates. We will do this by 
showing that under our map the quantum equivalent of 
the Toffoli gate is non-local. Specifically we will obtain 
the upper and lower bounds 

1 < E T < 2 . (6) 

Note that it is not essential for our method to find the 
exact value of Et since in general we are interested only 
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FIG. 1: Implementation of nonlocal Toffoli using 2 ebits as a 
resource. 



in the scaling of the number of Toffoli gates with the size 
of the problem. 

The lower bound Et > 1 is obtained by showing that 
under our map the quantum Toffoli gate is capable of 
producing at least one ebit of entanglement. Consider 
the test state 

l*& rt > = |(|0) 1 |0) 2 |0) 3 + |l)i|0) 2 |0} 3 + |0)i|l) 2 |0) 3 
-|1) 1 |1) 2 |1) 3 ) = -L|001)a|001) b + ^= (|000)a|000) b 



+|010) A |010) fl + |100) A |100) fl - |110) A |110) B 1 , 

where the third logical bit is the target of the Toffoli gate. 
After acting with Ut on l^f^ 3 ') we obtain 



e t t > = ^(io)iio) 2 |o) 3 



|l)i|0) 2 |0) 3 + |0) 1 |l) 2 |0) 



|l>i|l>a|0) 3 ) = 



1 



|000) a |000) b + |001)a|001) j 



+|010) a |0I0> b + |100)a|100) b + |011)a|011) b 
+|101)a|101) b - |110)a|H0)b - |111)a|1H)b 



The Schmidt coefficients are found to be {oti} — 
si I I I I n n rU and sn\ = (llllllll\ 

12» 8' 8' 8' 8' ' ' J XyiS 18'8'8'8'8'8'8'8J 

respectively. Hence E\^ st = 2 ebits and E^ = 3 ebits 
and Et > 1. (In passing we note that the quantum Tof- 
foli gate cannot be implemented without classical com- 
munication: if such an implementation were possible it 
would violate relativistic causality). 

To obtain the upper bound Et < 2 we will describe 
explicitly a method for realising the quantum map of 
the Toffoli that requires 2 ebits. Consider three pairs 
of qubits on which we are going to apply Ut, where 
the states are control and |0) 3 is a target 

(see Fig. It is convenient to analyze in parallel the 
cases where is |0)i or |l)i. The two parties start 
by performing local Hadamard rotations Ha(b) [acting 
as H\0) = ^=(|0) + |1», H\l) = ^=(|0) - |1))] of A, and 
B\ of the first pair, obtaining 



|0)i - |0')i = 4(|0)f |0)f + |i)f|i)f), 

f (7) 

|l)i - |l')i = ^ f (|0) 1 1 |l)f + |i)f|o)f). 

Then the parties proceed by performing local Toffoli 
gates on their particles, which can be written as 



u 



A(B) 



= |0)(0|i<8>/ 2 ®/3 + |lXl|i®E/23, (8) 



where C/ 23 is a local C-NOT between particles 3 and 2 
(with particle 3 as the control and particles 2 as the tar- 
get). As a result the initial states evolve to: 

|o')i|*) 2 |©) 3 - ^(|o)r|o)f + |i) r |i)f £4t/ 2 B 3 )|*> 2 |0>3, 
|i')i|*) 2 |0> 3 - ^(|o) r |i)f ug + |i)f |o)f c/&)|*) 2 |e> 3 . 

Next they swap the states of A\ and B\. This opera- 
tion utilizes two ebits and can be performed using two 
ordinary teleportations in both directions. This yields 



-^(|o)f|o)f + |itf|i>f E^)|*> 2 |e> 3 , 



i 
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^(|l) J 4 |0)f U 2 % + |0)f|l)f C^)|*> a |©) 3 . 

Next, they perform |JSJ again. The resulting states are 
|0')i|*) 2 |©) 3 and |l')iC/ 2 4 3 C/ 2 B 3 |*) 2 |0) 3 . 

Finally, they apply Ha and Hg again and obtain 
|O) 1 |*) 2 |0) 3 and |1) 1 C/ 2 A 3 [/ 2 S 3 |*) 2 |0) 3 . 

As we have already noted, two local C-NOT transfor- 
mations are equivalent to a nonlocal C-NOT transfor- 
mation. Thus, from the last expression it follows that a 
nonlocal C-NOT is applied on pairs 2 and 3 (with pair 
2 as the control and pair 3 as the target) if and only if 
the state of the first pair is |1). Thus this protocol im- 
plements the nonlocal Toffoli gate and utilizes two ebits, 
which are needed to swap two states in the intermediate 
stage. Due to linearity of quantum mechanics all these 
arguments will hold also in the case of arbitrary super- 
position of initial states. 

To conclude, from Eqs. |5l 0) we obtain the fol- 
lowing lower bound on the number Nt of Toffoli gates 
required to carry out a computation 



N T > 



\El 
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(9) 



We illustrate this result on the example of Shannon data 
compression. We were led to consider this particular 
example by our research in multi-particle entanglement 
compression In fact, this is how we discovered this 
method in the first place. 
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FIG. 2: Entanglement as a function of string length n for 
p = 0.8. The number of typical strings was calculated using 
non approximated 

The method of classical compression of n-bit source- 
string of 0's and l's, where p is the probability of each 
bit to be equal 1, is based on the fact that the most 
probable (typical) strings, generated by the source will 
contain np ones when n is large U If p ^ ~ then the 
Shannon entropy of the source H(p) is smaller than 1 
and the number of typical strings, 2 nH ^ p ' , is less than the 
total number of strings 2™. Thus, a message generated 
by the source can be compressed to a shorter message. 

We consider a "Shannon compressor" - a classical re- 
versible circuit which receives as input an n bit string 
which contains np ones (i.e. a typical string) and out- 
puts a compressed version of the string in which only the 

first log («) ~ nH(p) bits carry information and the 

other n(l — H(p)) redundant bits are set to some stan- 
dard sequence, e.g. to all 0's: 

XiX 2 ...X n -> /1/2— fnH0nH+l—0n- (10) 

Our goal is to find a lower bound on the number of Tof- 
foli gates needed to build the " Shannon compressor" . We 
take the initial test state to be the uniform superposition 
of states with np ones: 

\^Tn St )=N |xi)|x 2 >...|x n ), 

£i£{0,l},^^ . xi—np 



where N — 




. The output state is: 



\Kut)=N E |fl)|f2>-|fnH)|0 nH+ l)...|0 n >. 

/i£{0,l} 

For fixed value of p we can calculate the entanglement 
of |**^*) and |*^f/). The entanglement is easy 

to calculate: it equals the number of output redundant 
pairs, i.e. E^f = n(l — H(p)). We have calculated 



E^ at using a combination of analytical and numerical 
techniques which will be described in |5|. Fig. [2]presents 
our results for p = 0.8. A linear dependance of E^ 1 — 
E^t on n is obtained. Thus, the number of Toffoli gates 
needed to perform Shannon compression grows at least 
linearly with n. For instance for p — 0.8 we need at 
least 0.2332n Toffoli gates. Inspired by our numerical 
result, Buhrman has found, using a completely different 
technique, an analytical proof of this lower bound Q. 

In summary we have addressed the problem of evaluat- 
ing the number of Toffoli gates needed to perform classi- 
cal reversible computations. We have proposed a method 
based on quantum information concepts in which strings 
of classical bits are mapped into sequences of special non- 
local quantum states and classical reversible computa- 
tions are mapped onto unitary transformations of these 
quantum states. The nonlocal properties of these trans- 
formations provide information about the classical re- 
versible computation. In particular, if the unitary trans- 
formation is nonlocal then the corresponding classical 
reversible circuit cannot be built solely from one- and 
two-bit gates. The amount of non-locality possessed by 
the unitary transformation associated with any classical 
reversible computation provides a lower bound on the 
number of Toffoli gates needed for this computation. 

As an example we considered classical Shannon com- 
pression and calculated the amount of non-locality of 
the associated unitary transformation. According to our 
numerical results, the lower bound on the number of 
Toffoli gates grows linearly with the size of the string 
n. Thus quantum methods can provide fundamental in- 
sights about classical computation. 

We hope that our approach may prove useful for other 
problems concerning classical reversible computation. 
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